Search CORE

156 research outputs found

Controlling Output Length in Neural Encoder-Decoders

Author: Kikuchi Yuta
Neubig Graham
Okumura Manabu
Sasano Ryohei
Takamura Hiroya
Publication venue
Publication date: 01/01/2016
Field of study

Neural encoder-decoder models have shown great success in many sequence generation tasks. However, previous work has not investigated situations in which we would like to control the length of encoder-decoder outputs. This capability is crucial for applications such as text summarization, in which we have to generate concise summaries with a desired length. In this paper, we propose methods for controlling the output sequence length for neural encoder-decoder models: two decoding-based methods and two learning-based methods. Results show that our learning-based methods have the capability to control length without degrading summary quality in a summarization task.Comment: 11 pages. To appear in EMNLP 201

arXiv.org e-Print Archive

Crossref

Extracting Semantic Orientations of Words using Spin Model

Author: Hiroya Takamura
Manabu Okumura
Takashi Inui
Publication venue
Publication date: 01/01/2005
Field of study

We propose a method for extracting semantic orientations of words: desirable or undesirable. Regarding semantic orientations as spins of electrons, we use the mean field approximation to compute the approximate probability function of the system instead of the intractable actual probability function. We also propose a criterion for parameter selection on the basis of magnetization. Given only a small number of seed words, the proposed method extracts semantic orientations with high accuracy in the experiments on English lexicon. The result is comparable to the best value ever reported.

CiteSeerX

Crossref

Semi-Supervised Learning for Blog Classification.

Author: Daisuke Ikeda
Hiroya Takamura
Manabu Okumura
Publication venue
Publication date: 01/01/2008
Field of study

Abstract Blog classification (e.g., identifying bloggers' gender or age) is one of the most interesting current problems in blog analysis. Although this problem is usually solved by applying supervised learning techniques, the large labeled dataset required for training is not always available. In contrast, unlabeled blogs can easily be collected from the web. Therefore, a semi-supervised learning method for blog classification, effectively using unlabeled data, is proposed. In this method, entries from the same blog are assumed to have the same characteristics. With this assumption, the proposed method captures the characteristics of each blog, such as writing style and topic, and uses these characteristics to improve the classification accuracy

CiteSeerX

Learning to Select, Track, and Generate for Data-to-Text

Author: Aramaki Eiji
Ishigaki Tatsuya
Iso Hayate
Kobayashi Ichiro
Miyao Yusuke
Noji Hiroshi
Okazaki Naoaki
Takamura Hiroya
Uehara Yui
Publication venue: 'Association for Computational Linguistics (ACL)'
Publication date: 01/01/2019
Field of study

We propose a data-to-text generation model with two modules, one for tracking and the other for text generation. Our tracking module selects and keeps track of salient information and memorizes which record has been mentioned. Our generation module generates a summary conditioned on the state of tracking module. Our model is considered to simulate the human-like writing process that gradually selects the information by determining the intermediate variables while writing the summary. In addition, we also explore the effectiveness of the writer information for generation. Experimental results show that our model outperforms existing models in all evaluation metrics even without writer information. Incorporating writer information further improves the performance, contributing to content planning and surface realization.Comment: ACL 201

arXiv.org e-Print Archive

Crossref

カンドウミャクケイユ IVH reservoir リュウチノケイケン

Author: Hasegawa Masakazu
Hiramatsu Kazuhide
Miyamoto Noriyuki
Saitoh Hiroya
Takamura Akio
Takeuchi Shuuhei
サイトウヒロヤ
タカムラアキオ
タケウチシュウヘイ
ハセガワマサカズ
ヒラマツカズヒデ
ミヤモトノリユキ
宮本憲幸
平松一秀
武内周平
長谷川雅一
高邑明夫
齋藤博哉
Publication venue: 日本医学放射線学会
Publication date: 25/12/2002
Field of study

Osaka University Knowledge Archive

Gore-Tex covered EMS ニヨルタンドウナイロウジュツ

Author: Horio Keiji
Saito Hiroya
Sakurai Yasuo
Takamura Akio
サイトウヒロヤ
サクライヤスオ
タカムラアキオ
ホリオケイジ
堀尾圭司
桜井康雄
高邑明夫
齋藤博哉
Publication venue: 日本医学放射線学会
Publication date: 25/02/1994
Field of study

Osaka University Knowledge Archive